Approximate Dynamic Programming: An Efficient Machine Learning Algorithm
نویسندگان
چکیده
We propose an efficient machine learning algorithm for two-stage stochastic programs. This is termed as projected hybrid algorithm, and consists of sub-gradient piecewise linear approximation methods. use the sample information to update on objective function. Then we introduce a projection step, which implemented methods, jump out from local optimum, so that can achieve global optimum. By innovative show convergent property general Furthermore, network recourse problem, our drop steps, but still maintains convergence property. Thus, if properly construct initial functions, pure method The proposed approximate dynamic programming overcomes high dimensional state variables using methods learning, its logic capture critical ability structure anticipate impact decisions now future. optimization framework, carefully calibrated against historical performance, make it possible changes in collective intelligence experienced decisions. Computational results indicate exhibits rapid convergence.
منابع مشابه
Efficient sampling in approximate dynamic programming algorithms
Dynamic Programming (DP) is known to be a standard optimization tool for solving Stochastic Optimal Control (SOC) problems, either over a finite or an infinite horizon of stages. Under very general assumptions, commonly employed numerical algorithms are based on approximations of the cost-to-go functions, by means of suitable parametric models built from a set of sampling points in the d-dimens...
متن کاملAn Approximate Dynamic Programming Algorithm for Monotone Value Functions
Authors are encouraged to submit new papers to INFORMS journals by means of a style file template, which includes the journal title. However, use of a template does not certify that the paper has been accepted for publication in the named journal. INFORMS journal templates are for the exclusive purpose of submitting to an INFORMS journal and should not be used to distribute the papers in print ...
متن کاملDynamic Programming for Approximate Expansion Algorithm
Expansion algorithm is a popular optimization method for labeling problems. For many common energies, each expansion step can be optimally solved with a min-cut/max flow algorithm. While the observed performance of max-flow for the expansion algorithm is fast, its theoretical time complexity is worse than linear in the number of pixels. Recently, Dynamic Programming (DP) was shown to be useful ...
متن کاملOptimal Learning and Approximate Dynamic Programming
Approximate dynamic programming (ADP) has emerged as a powerful tool for tackling a diverse collection of stochastic optimization problems. Reflecting the wide diversity of problems, ADP (including research under names such as reinforcement learning, adaptive dynamic programming and neuro-dynamic programming) has become an umbrella for a wide range of algorithmic strategies. Most of these invol...
متن کاملApproximate Dynamic Programming and Reinforcement Learning
Dynamic programming (DP) and reinforcement learning (RL) can be used to address problems from a variety of fields, including automatic control, artificial intelligence, operations research, and economy. Many problems in these fields are described by continuous variables, whereas DP and RL can find exact solutions only in the discrete case. Therefore, approximation is essential in practical DP a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Artificial intelligence
سال: 2023
ISSN: ['2633-1403']
DOI: https://doi.org/10.5772/intechopen.106691